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Abstract 

The Correlates of War project scrupulously collects the information about disputes 
between the countries over a long historical period together with other data relevant 
to the character and reasons of international conflicts. Using methods of modern 
Data Science implemented in the R statistical software, we investigate the datasets 
from the project. We study political, economic, and religious factors with respect to 
the emergence of conflicts and wars between the countries. The results obtained lead 
to certain conclusions about variances and causalities between the factors considered. 
Some unpredictable features are presented. 
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1 Introduction 

Here we study datasets from the Correlates of War (COW) project which provides accu¬ 
rate and reliable quantitative data in international relations, in particular, in the disputes 
between countries over a long historical period together with other information relevant to 
the character and reasons of the conflicts. 

Diplomatic, economic, and religious aspects are among the factors affecting emergence 
of international tensions. In this paper we have investigated them using statistical methods 
implemented in the R data analysis software. 

2 Distribution of countries over the number of their wars 

Before looking at the above-depicted factors, let us range the countries on their “belliger¬ 
ence”, i. e., the number of wars they have been involved in throughout the period 1816-2010 
[3]. The distribution turns out normal at whatever level of statistical significance as sug¬ 
gested by the results of four basic normality tests executed on R: 
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Figure 1: Histogram - distribution of countries (on their codes) by the numbers of wars 
they have been involved in over the period 1816-2010 together with Q-Q plot 
for this distribution 


> shapiro.test(no_of_wars) 

Shapiro-Wilk normality test 

data: no_of_wars 
W = 0.55004, p-value < 2.2e-16 

> library(nortest) 

> ad.test(no_of_wars) 

Anderson-Darling normality test 

data: no_of_wars 
A = 26.233, p-value < 2.2e-16 

> cvm.test(no_of_wars) 

Cramer-von Mises normality test 

data: no_of_wars 
W = 4.9752, p-value = 7.37e-10 

Warning message: 

In cvm.test(no_of_wars) : 

p-value is smaller than 7.37e-10, cannot be computed more accurately 

> lillie.test(no_of_wars) 

Lilliefors (Kolmogorov-Smirnov) normality test 

data: no_of_wars 
D = 0.28451, p-value < 2.2e-16 
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Yet, as shown by the Q-Q plot at Fig. [lj there is a good deal of “outliers” which turned 
out involved in the largest number of wars. The top 12 most “war-full” countries are thus 
USA, Russia, UK, China, Germany, France, Japan, Turkey, Iran, Italy, Is¬ 
rael, and Iraq. They swerve from the normal distribution of the rest of the countries. 
This is quit a natural list since those are the countries with complicated history (partici¬ 
pants of both world wars or parts of regions with traditionally high military activity). 

Evidently, these countries are of the most interest regarding the studies of the wars 
reasons. Later on we will then focus on the countries from this list. 


3 Examining principal factors of wars 

3.1 Political relations at times of war and peace 

Variables. To estimate the value of diplomacy in disputes between a pair of countries, 
two factor variables were considered: Diplomatic representation level (DR) of side 2 at side 
1 and vice versa, and Any diplomatic exchange ( DE ) between sides 1 and 2 [2j - DR can 

have following values: 

/ 0=no evidence of diplomatic exchange 
/ l=charge d’affaires 
/ 2=minister 
/ 3=ambassador 
/ 9=other 

Measurement scales, missing values treatment. The last level of DR looks like an 
empty value and needs to be handled accordingly. Level 9 means “interest sections, inter¬ 
ests served by another country, address only” etc. [2J, i.e., kind of “minor” representation. 
It seemed thus reasonable to rearrange the levels in such a way that level 9 took the place 
between levels 0 and 1. Since then the variable was treated as ratio rather than nominal. 

The DE variable may equal: 

/ 0=neither side was represented in the other side 
/ l=at least one side was represented in the other side 
It was also treated as a ratio variable. 

We thus consider the variance of DR and DE with respect to two categorical variables, 
viz. a country (1 or 2, the countries - a dispute opponents), and a state (war or peace). 
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Table 1: Results of tests of dependencies between diplomatic relations of a few selected 
countries with their opponents at times of peace and war. As per the standard 
notation symbols ‘***’, '**’, £ *’, V to the right of the numbers mean statistical 
significance at levels 0.1%, 1%, 5%, 10%, respectively 



Representation levels 

on both sides (DR) 


DE 


country - its opponents 

war-peace interaction 

war-peace 

Top-12 list 

countries: 

USA 

1.83e-07 *** 2.65e-08 *** 

0.08. 

2e-16*** 

Russia 

5.38e-06*** 

2.2e-16*** 

0.71 

1.08e-09*** 

UK 

7.62e-ll*** 

0.061. 

0.52 

0.20 

China 

0.20 

0.70 

0.94 

0.23 

Germany 

2.2e-16*** 

0.0015** 

0.72 

0.00087*** 

Others: 

NorthKorea 

0.73 

0.13 

0.90 

0.073. 

Libya 

0.92 

0.0061** 

0.75 

0.56 

Somalia 

9.3e-05*** 

0.42 

0.98 

0.049* 


Perform a two factor analysis of variance (ANOVA) jTJ to investigate the relations of 
interest. 

Instead of investigating the countries pairwise, we chose a few of them and considered 
each versus all the countries it ever had disputes with in 1817-2005. Such a method gave 
much larger and homogeneous data samples compared to the pairwise approach, which 
better satisfied the ANOVA assumptions (see the next subsection). 

Test results. Table [l] shows the results of the two factor ANOVA test on the relation 
between DR and the two chosen factors. These factors are: a certain country on the one 
hand and all its opponents on the other hand, and a war/peace state. Also shown is the 
significance of correlations with the combination of the first and second factors. Again, the 
one factor correlation is presented of the DE variable (common for both sides) with the 
war/peace state. 

Evidently, in most of the cases there indeed is a statistically significant relation between 
diplomatic activities on any side and the state (war/peace) the opponents are in. Such a 
significance was observed for a much larger number of countries examined. 
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Exception. Note, however, a couple of rare violations of this trend: absence of statistical 
significance in diplomacy of China and North Korea, even though both have pretty long 
peaceful and tense history. What is a common between these countries? In particular, 
a strong political isolation at times of communist past and present. Perhaps, this is the 
reason of low diplomatic response to the war/peace state in these cases. 

3.1.1 Meeting the ANOVA assumptions 

For a correct use of ANOVA following assumptions are known to be satisfied: 
normality 

homogeneity of variance 
independence of observations 

In that regard, all the tests were accompanied by checking the normality of distributions 
withing the samples, as well as their covariances. In virtually all the cases the normality 
was great, like in the case “China-opponents”: 

> library(nortest) 

> lillie.test(samplel) 

Lilliefors (Kolmogorov-Smirnov) normality test 
data: sample1 

D = 0.25952, p-value < 2.2e-16 
# . . . 

data: sample2 

D = 0.24207, p-value < 2.2e-16 
# . . . 

data: sample3 

D = 0.29846, p-value < 2.2e-16 
# . . . 

data: sample4 

D = 0.24584, p-value < 2.2e-16 
# . . . 

data: sample_general 
D = 0.49108, p-value < 2.2e-16 

The variances of the four groups of DR values (corresponding to all the combinations of 
values “side 1, side 2”—“war, peace”) were comparable like in the case involved: 

variances: 
sample1= 1.33 
sample2= 1.58 
sample3= 1.12 
sample4= 1.49 

This evidently assures homogeneity of the samples variance. 

As to the independence of observations, it is guaranteed the by standard scientific prin¬ 
ciples of the data collection on the COW project. 
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3.1.2 All the countries. 


Finally, we performed a general test. For the two factor ANOVA we also had fonr samples 
like the ones mentioned in subsection |3.1.1 They were such that sides 1, 2 involved the 


diplomatic activity data (DR) for the years of disputes of any country against all the 
countries it ever was in the state of a dispute, as well as the data for the correspondent 
pairs of the countries at peaceful times. 


> anova( lm(dip_presence ~ countries*state)) 


Analysis of Variance Table 


Response: dip_presence 

countries 

state 

countries:state 
Residuals 


Df Sum Sq Mean Sq F value Pr(>F) 
1 0 0.00000 0.000 1.0000 

1 0 0.09898 0.063 0.8019 

1 0 0.00000 0.000 1.0000 

263880 414910 1.57234 


For the DE variable, common for a pair of opponents, the results of the one factor test 
are as follows: 


anovaC lm(dip_exchange ~ state)) 


Analysis of Variance Table 
Response: dip_exchange 


state 

Residuals 


Df Sum Sq Mean Sq F value Pr(>F) 

1 3 3.4820 20.384 6.337e-06 *** 

263880 45075 0.1708 


Signif. codes: 0 '***' 0.001 '**' 0.01 0.05 0.1 ' ' 1 


Analysis. As per the test results, the general level of diplomatic activity of opponents 
(DE) vs. the war/peace state is the only statistically significant relation. Apparently, 
this means that the different opponents in different years could have different structures 
of political activity with respect to each other but these structures are “averaged” over the 
large number of countries and years without yielding any characteristic pattern, common 
for all the countries. 

What is indeed common for all the countries, it is the sensitivity of the joint level of their 
diplomatic activity to essential conditions of the interstate relations dictated by essential 
circumstances — war or peace (DE vs. war/peace). 


3.2 Economic state at times of war and peace 

Among a great number of the countries’ economic life characteristics provided by the 
“economic” datasets [7J, we chose the common aggregated parameter, Composite Index of 
National Capability (CINC). 
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Table 2: Results of tests on dependencies between Composite Index of National Capability 
of a few selected countries vs. that of their opponents at times of peace and war 



country - its opponents 

war-peace 

interaction of both factors 

Countries from 

top-12 list: 

USA 

2.2e-16*** 

0.062. 

0.017* 

Russia 

2.2e-16*** 

0.00117** 

0.00104** 

UK 

2.2e-16*** 

0.018* 

0.45 

China 

2.2e-16*** 

1.9e-13 *** 

2.2e-16*** 

Germany 

2.2e-16*** 

0.010* 

0.97 

Others: 

NorthKorea 

2.2e-16*** 

1.84e-07*** 

5.2e-07*** 

Libya 

2.0e-16*** 

0.077. 

0.099. 

Somalia 

2e-16*** 

0.0094*** 

0.011* 


Non-available values. A small number of NAs was found. Since CINC may change 
essentially throughout the historical period, instead of traditional replacement of NAs by 
a median or mean value, it was found reasonable to use a valid CINC of the nearest year 
for the replacement. 

Results of the tests. Table [2] demonstrates the results of two factor ANOVA for the same 
set of countries as in the diplomatic case. We see that the statistical significance here is 
evident and even better then in Table [I] This makes consider the economic factor most 
objective indicator of the quality of interstate relations. 

Let us visualize this significance by plotting the differences of CINC for the four selected 
pairs of the countries over the whole periods of the relations between them in wai0 and 
peace (Fig. [2]). 

In our view, the orange points of war are concentrated in the regions of extremely steep 
slopes of the plotted dependencies, or when a dependency sharply changes its character. 
Such is the CTAC-peace/war correlation. 

Differently looks case (c). We see there pretty abrupt oscillations of differences between 
the economic potentials near zero. Probably, one could say about an economic rivalry 

lu A war between Russia and USA” sounds dreadfully V. It should be noticed, however, that the COW 
project treats “war” in a generalized way, using the term “dispute”. This can imply, for example, show 
of force without direct militarized actions. Such was, for instance, Cuban Missile Crisis of 1962. 
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Figure 2: Differences of economic states between countries-opponents 


correlated with a military one as a specific point of the relations between Israel and Egypt. 

3.3 Factor of religion 

Nowadays religious aspects of international tension is a matter of great public concern. 
That is why this factor is duly considered here. 

The Correlates of War provide a variety of data relevant to this subject as well [6j. 
For the study we took two values: the general percentage of religious adherents in the 
population of a particular country, and the percentage of Islam adherents. Both values are 
available for every half-decade period since 1945 up to 2010. 

Fig. 0 shows relations between those two percentages and the number of wars the 
countries participated in. 

Plot (a) points out surprisingly large part of the countries which had highly religious pop¬ 
ulation during last 65 years. In its turn, this part contains a few countries with extremely 
large number of wars in their history during this period. 

Looking at plot (b) we can conclude that high involvement in wars is by no means a 
general feature of Islamic states. However, there are six countries of this kind with really 
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no of wars 



(a) total percentage of religious adherents (b) percentage of Muslim adherents 

Figure 3: Countries’ involvement in wars in connection with the religious factors 


substantial history of disputes: Iran, Iraq, Turkey, Syria, Egypt, and Pakistan. 


4 Conclusions 

■s 1 ' The tests of normality have distinguished a dozen countries with complicated history 
whose involvement in wars has been essentially different from the rest of the world. 
In our research we have been making a particular emphasis on these countries. 

The diplomatic activity has turned out to significantly depend on a particular country 
at times of both war and peace. The tests of the whole of the countries have shown 
that any patterns specific for a particular state are “averaged” so there is no evidence 
of any commonly preferred ways to conduct foreign policy at times of war or peace. 
However, the degree of joint diplomatic activity has proved to substantially depend 
upon the level of interstate tension throughout two centuries of the world history. 

bs" As to the economic factor, a strong correlation has been revealed between the differ¬ 
ence of the opponents’ Composite Indexes of National Capability and the character 
of relations between pairs of the countries. Visualization of this correlations points to 
an onset of a dispute at the moments when the difference in the countries’ economic 
health rapidly changes. 

bs* Finally, the religious factor has been shown to significantly correlate with the war/peace 
conditions. It turns out that countries with higher percentage of religious adherents 
have been more involved in wars during the last 65 years. As to the Islamic factor , it 
looks like it hardly affects military activity greatly per se. High involvement in wars 
of six large Islamic countries is evidently caused by the combination of their unique 
politics, economics, and culture. 
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No doubt, the great number and variety of Correlates of War data contain more features, 
relations, and causalities. So the continuation of this research may follow in some future. 
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